Testing homogeneity of a large data set by bootstrapping

نویسندگان

  • K. Morimune
  • Yohei Hoshino
چکیده

It is not rare to analyze large data sets these days. Large data is usually of census type and is called the micro data in econometrics. The basic method of analysis is to estimate a single regression equation with common coefficients over the whole data. The same applies to other method of estimation such as the discrete choice models, Tobit models, and so on. Heterogeneity in the data is usually adjusted by the dummy variables. Dummy variables represent socioeconomic differences among individuals in the sample. Including the coefficients of dummy variables, only one equation is estimated for the whole large sample, and it is usually not preferred to divide the whole sample into sub-samples. Data is said to be homogenous in this paper if a single equation is fit to the whole data, and it explains socioeconomic properties of the data well. We may estimate an equation in each sub-population if the whole population is divided into known subpopulations. It is assumed that the coefficients are different from one sub-population to another in this case. Data is said to be heterogeneous in our paper. The analysis of variance is applied if sub-populations are known and sub-sample is collected from each subpopulation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Robust Bootstrap Algorithm for the Assessment of Common Set of Weights in Performance Analysis

The performance of the units is defined as the ratio of the weighted sum of outputs to the weighted sum of inputs. These weights can be determined by data envelopment analysis (DEA) models. The inputs and outputs of the related (Decision Making Unit) DMU are assessed by a set of the weights obtained via DEA for each DMU. In addition, the weights are not generally common, but rather, they are ve...

متن کامل

An Integrated DEA and Data Mining Approach for Performance Assessment

This paper presents a data envelopment analysis (DEA) model combined with Bootstrapping to assess performance of one of the Data mining Algorithms. We applied a two-step process for performance productivity analysis of insurance branches within a case study. First, using a DEA model, the study analyzes the productivity of eighteen decision-making units (DMUs). Using a Malmquist index, DEA deter...

متن کامل

Weighted tests of homogeneity for testing the number of components in a mixture

An important but di-cult problem in .tting .nite mixture models is estimating and testing the number of components in the mixture. Regularity conditions do not hold for large sample likelihood theory so that likelihood ratio tests cannot easily be implemented. However, a number of homogeneity tests have been developed to test for the presence of a mixture. Weighted versions of homogeneity tests...

متن کامل

Nonparametric Estimation and Testing in Panels of Intercorrelated Time Series

We consider nonparametric estimation and testing of linearity in a panel of intercorrelated time series. We place the emphasis on the situation where there are many time series in the panel but few observations for each of the series. The intercorrelation is described by a latent process, and a conditioning argument involving this process plays an important role in deriving the asymptotic theor...

متن کامل

Bootstrap Procedures for Testing Homogeneity Hypotheses

Before pooling data on effect sizes (a generic term for parameters of interest in the context of meta-analysis) from different studies, it is important to test for homogeneity of the effect sizes. A well known test for homogeneity is based on Cochran’s chisquare statistic. Our recent investigation showed that when the effect size of interest is a pairwise correlation, Cochran’s homogeneity test...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Mathematics and Computers in Simulation

دوره 78  شماره 

صفحات  -

تاریخ انتشار 2008